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Abstract 

Noise in gene expression, either due to inherent stochasticity or to varying inter- and 
intracellular environment, can generate significant cell-to-cell variability of protein 
levels in clonal populations. We present a theoretical framework, based on stochas- 
tic processes, to quantify the different sources of gene expression noise taking cell 
division explicitly into account. Analytical, time-dependent solutions for the noise 
contributions arising from the major steps involved in protein synthesis are derived. 
The analysis shows that the induction level of the activator or transcription factor 
is crucial for the characteristic signature of the dominant source of gene expression 
noise and thus bridges the gap between seemingly contradictory experimental re- 
sults. Furthermore, on the basis of experimentally measured cell distributions, our 
simulations suggest that transcription factor binding and promoter activation can 
be modelled independently of each other with sufficient accuracy. 
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Introduction 



Within a genetically identical population, individual cells show significant phenotypic 
heterogeneity [3; [s). This variability directly afi'ects the cell's ability to respond to en- 
vironmental factors like changes in ligand concentration. Especially, reactions underlying 
protein synthesis are often based on small numbers of molecules, like transcription factors 
or ribosomes, such that stochastic fluctuations have to be taken into account. 
A lot of effort has been undertaken to quantify the origins of gene expression noise ex- 
perimentally and theoretically. Stochasticity or noise inherent to gene expression seems to 
be one of the main driving forces for the observed cell-to-cell variability in several experi- 
ments which have measured the variance in protein abundances in different cellular systems 
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18l ). Considerable confusion stems from diverg- 



ing experimental results which have identifled different origins for the main contribution to 
gene expression noise jiol ) such that a complete picture is still missing. For prokaryotes, 
translational efficiency was identified as the main source of variability of expression levels 
consistent with a stochastic model in which proteins are produced in sharp and random 
bursts (1201 ). However, later experimental observations in individual living cells either by 
measuring mRNA levels or by real-time observations at single molecule level indicated that 



promoter activation predominantly causes gene expression noise ((21 



22l ). Furthermore, ex- 



trinsic factors, like the cellular state, were also identified to give the main contribution to 
phenotypic variations in a clonal population (IigI ). Similar contradictory results have been 
found in eukaryotes, where in the budding yeast Saccharomyces cerevisiae a two-reporter 
system, expressing two fiuorescent proteins from identical promoters, identified switching 
between active and inactive promoter states due to slow stochastic chromatin-remodelling 

n 

events as the by far largest source of noise ()5|). In later in-vivo experiments it was shown 
for a large set of genes at their native expression levels that the noise has a clear sign of 
transcriptional origin due to low-copy mRNA molecules (Q; fl^ . Moreover, a direct moni- 
toring of mRNA production from a gene at the resolution of single molecules in mammals 
revealed strong mRNA bursts dominating gene expression noise (9y. In contrast, for human 
cells, genes at native induction level showed significant noise contribution from long-term 
variations of the cellular state (l3)- It seems on first sight that no general rule can be 
given to determine the main sources of gene expression noise. Protein levels, however, are 
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often strongly optimized, because they have to allow for precise and reliable information 
processing within the cell. Any significant deviation from the optimal level would result 
in reduction of fitness and an evolutionary disadvantage. Thus, random fluctuations are in 
general detrimental for cellular systems and several regulatory mechanisms have evolved to 
minimize them. Only in rare cases noise can be used to drive phenotypic switching provid- 
ing a non-genetic mechanism to population heterogeneity, as found for bacterial per sistence 
against antibiotics (2^ and competence for DNA uptake from the environment ^2^. 
In order to track down the individual contributions of the molecular mechanisms involved in 



protein synthesis several mathematical models have been introduced (125 
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. Some of these models ignore the effect of binomial partitioning by cell division which 



will lead to strong discrepancies to experiments whenever cellular mRNA is long-lived and 



appears in low copy number (129 



30 



mRNA bursts (25 



3ll ). whereas others lack the dynamic description of 



271 ). In the present work we develop an analytical framework which allows 



for a time-dependent description of gene expression and accounts for effects of symmetric cell 
division. We consider a one- gene-system consisting of activator/transcription factor (TF) 
binding (repressor unbinding), promoter activation, transcription, and translation (Fig. [1]). 
All gene specific events contribute to the so-called intrinsic noise. Differences between cells, 
either in global cellular state or in the concentration or activity of any factor that affects 
gene expression are referred to as extrinsic noise (jj). Therefore, the cell-to-cell variability of 
a specific protein in a large clonal population with fixed generation time is characterized by 
the two contributions of intrinsic and extrinsic noise, summing up to the overall variance of 
the protein. Assuming no specific feedback of a produced protein on upstream processes, the 
intrinsic noise contribution decomposes into partial contributions stemming from activator 
binding, promoter activation, transcription and translation. In deriving analytical expres- 
sions for these partial contributions to gene expression noise, we discuss limiting cases for 
mRNA and protein lifetimes. We show that the magnitudes of the different noise contribu- 
tions depend strongly on induction level, synthesis rates, and molecule lifetimes associated 
with each individual gene. Therefore, differences in the induction level, e.g., due to different 
experimental set-ups, might provide a possible explanation for the diverging experimental 
findings of cell-to-cell variations even in the same organisms. Furthermore, on the basis of 
experimentally measured cell distributions of wild-type and over-expressed cells of E. coli, 
our simulations propose that, in prokaryotes, activator binding and promoter activation are 
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independent of each other and thus can be modelled for a good approximation separately. 



Results 

Minimal model for gene expression noise 



Four major steps are involved in a generic model of gene expression in living cells (130| ): 



(i) activator/TF binding (repressor unbinding), (ii) promoter (DNA) activation, (iii) tran- 



scription, and (iv) translation. In the present work we follow a previous approach (l30l ) 
and model synthesis and degradation of mRNA and protein by a birth- and- death process. 
Activator binding and promoter activation are described as a random telegraph process, 
because they are assumed to switch randomly between zero and one with exponentially dis- 



tributed waiting times (|9|; |2l| ). The state of the activator is given by the stochastic variable 
B{t) switching between B{t) = 1 and B{t) = 0, if the activator is bound or unbound, re- 
spectively. Promoter activation can be expected to occur on much slower time scales than 



activator binding (12 ll : l22l : l32l : l33l ) such that the time scales can be separated. Therefore, 
we approximate activator binding, B{t), as an equilibrated binding process. The Master 
equation for the probability to find a promoter in its active, P{l,t), or inactive, P(0,t), 
state is given by 

dtPil,t) = -\^Pil,t) + \\PiO,t), (1) 

where represents the switching rate from the inactive to the active state and the rate 
for the inverse process. We define 

7A = A+ + A;^ (2) 

and denote the initial state of the promoter at time to by Oq. The solution for ([1]) with 
initial conditions P(l, tolao, ^o) = ^i,ao reads: 

P(l,t|ao,to) = ^ + - —) e-^-(*-*°). (3) 

We can assign the stochastic variable A(t) G {0, 1} to describe the state of the promoter. The 
process of promoter activation seems to have no significant correlations with the cell cycle 
21). Therefore, we can employ a stationary solution for the auto-covariance function 

{A{t)A{t')) - {A{t)){A{t')) = mr(A)e-^^l*-*'l (4) 



with var{A) = and (A) = X\/'yA- The mean {A) can be interpreted as a measure 

of the fraction a promoter spends in its active state for which holds < (A) < 1 . 
Transcription and translation are modelled as a birth- and- death process. The corresponding 
mRNA and protein trajectories are denoted by R{t) and X{t), respectively. For the condi- 
tional probability P{ni,t\n'-,to) to observe rii molecules {rii = R,n2 = X) in a given cell at 
time t we obtain the stochastic Master equation 

dtP{ni,t\n'i,to) = X+{t)P{n,-l,t\n',,to) 

+ {ni + l)XrP{ni + l,t\n[,to) 

- (Arn, + A+(t))P(n,,t|n:,to) (5) 

where n'^ molecules are observed initially at time to- Synthesis rates are given by A^(t) = 
\'^A(t)B{t) for the mRNA or A^(t) = Aj^-R(t) for the proteins. The corresponding degrada- 
tion rates, A|f = and A2 = A^^ , are assumed to be independent of molecule number and 
constant in time. Furthermore, the trajectories A{t) and B{t) are assumed to be indepen- 
dent of each other which will be justified below. Transcription at time t is only possible if 
the activator is bound and the promoter is in the active state, A{t)B{t) = 1, cf. Fig. [H We 
assume that the translation rate of specific proteins depends linearly on the actual amount 
of the corresponding mRNA, R{t). 

A lot of effort has been undertaken to generate time resolved expression data of single 



cells (1151). The mathematical description of gene expression becomes more complicated if cell 
division is included, because cell division introduces another important time scale into the 
system, the generation time Tq- Previous theoretical approaches avoided this problem by 
assumin g an increased protein degradation rate instead of explicitly taking cell division into 
account (:2y). Such an approximation might be justified for systems where effects of binomial 
partitioning can be neglected and cellular growth and protein synthesis scale linearly in 
time. In this case the concentrations - but not the absolute copy number of molecules - are 
approximately constant over cell cycles. However, one should note that experiments measure 
the total molecule number instead of concentrations. Measurement of concentrations would 
require precise knowledge of the individual cell volume at any time. Recent experiments 
show that the main effect of cell division seems to be binomial partitioning of molecules 



(16 



21I ) such that the assumption of an increased protein degradation does not hold. 



If we assume symmetric cell division at time to with f < to < t and a binomial distribution 



of the molecules, the following relationship for the conditional probability P{ni,t\n'^, t') holds: 

(6) 



P{nut\n'J) = Yl 5^ ^K,t|n+ to)r+ l2-">P(n^,^o|<,^')• 
r^-=on+=o ^^^^ 

The amounts of molecules before and after the last cell division occurring at time to are 
represented by n~ and n^, respectively. To solve Eqn. ([5]) we use the generating function 
G{s,t\n'^,t') = X]^=o ■^"'-^(^«' ^l^i' ^^"^ assume a finite number of cell divisions between 
t' and t. After some algebra we arrive (for derivation see SI) at: 



G{s,t\n[,t') = [2-^(*'*')(s - l)e-^»" + 1 



(7) 



X exp 



(s-l) / A+(t")e-"»'(*-*")2-^(*'*")cit" 
Jt' 



where D{t,t') is the number of cell divisions that have occurred between two time points 
if: > t'. The mean amount of molecules at time t > t' can be calculated via dsG\s=i. If we 
assume that initially the process is Poissonian-distributed the mean amount of molecules is 
given by 

{n,(t)), = (n,(t')).2-^(*''*)e~^«" + rA+(t")e-"n*-*")2'^(*'*")(it" (8) 

Jt' 

where the average over i? or X is denoted by ( . )j. We define the generating function 
6(s,t|n^to) = X]l^=o '^"'-^('^*' ^l^i' ^o) and use Eqn. (IT!) to obtain the auto-correlation func- 
tion ( (llTI )) via dsQ\s=i (for detailed derivation see SI): 



in. 



:(t),n.(t')). = (n.(0).2-^(*'*')e-^r 



(9) 



Eqn. (Q shows that each cell division halves the magnitude of the auto-correlation function. 
Replacing nj by i? or X, we obtain the mean amount and the auto-correlation function of 
the mRNA or the protein from Eqns. ([8]) and ([9]), respectively. 



Partial contributions to gene expression noise 

The heterogeneity in gene expression of a population can be quantified using the standard 
deviation (a) divided by the mean (/x), i.e., 1] = a/fi. The quantity rj is commonly denoted 
as noise and provides a physiologically relevant measure of gene expression variability as it 
quantifies relative fluctuations independent of the expression level. Two main contributions 
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to the overall variance, a^^f-, determine the cell-to-cell variability in the amount of a protein, 
X: the intrinsic variance, aj, which is distinctive for each gene in its genomic context and an 
extrinsic noise contribution stemming from the variance a]^, which is due to fluctuations in 
the intra-cellular environment. If we do not assume any significant feedback of the expressed 
protein on extrinsic factors (cf. Fig. [T] and (134)). the overall variance in a large clonal 
population of cells with fixed generation time Tq sums up to: 

cTL = aj + al (10) 

The extrinsic contributions are usually separated into fluctuations of upstream factors 
that drive expression directly, e.g., a given activator concentration and the cellular state 
that influences gene expression, e.g., via variations in polymerase, ribosome, and pro- 
tease concentrations. The average over all possible trajectories of protein copy number 
X{t), mRNA copy number R{t), promoter activation A{t), and activator binding B{t) is 
defined by ( . )i = / . P[X{t); R{t); A{t); B{t)]dX{t)dR{t)dA{t)dB{t) with the multi-index 
I = {X,R,A,B), using e.g., {.)x = J . P[X{t)\R{t)]dX{t). Note that P[X;R;A;B] can 
be written as P[X|i?]P[_R|A, _B]P[yl]P[i?]. We can identify different contributions to the 
intrinsic noise of protein synthesis: 

a] = ((X(t) - (X(t))x,i?,A,i^)'>x,R,A,B 

= {{Xit) - {X{t))xf)^^^^^^^ 

+ {{{X{t))x - {X{t))x,R?)^^^^^ 

+ {{{X{t))x,H-{X{t))x,n,A?)^^^ 

+ {i{Xit))x,R,A - {Xit))x,R,A,Bf)^ (11) 

= al + al + al + al, (12) 

where the right-hand-side denotes the sum over variances (for derivation see SI) correspond- 
ing to the processes of activator binding (a^), promoter activation {cr\), transcription (ct|j), 
and translation {cx), respectively. The summing up of the individual variances is only 
possible if there are not any feedbacks from downstream to upstream processes. We find 
for the noise contribution due to translation: rjx = (y\/{X)\ = l/{X)i. Also, the binomial 
distribution of the proteins caused by cell division converges quite rapidly to a Gaussian 
distribution for an increased amount of molecules. The amount of protein synthesized 
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per mRNA can be estimated to be of the order 10 — 10^ for most systems. Thus, for the 
experiments considered in this work, we can neglect translational noise in comparison to 
other noise contributions. 



In order to make the analytical calculations feasible, we assume a fixed induction level of 
the activator, B{t) = B^q. Due to the separation of the time scales (fast activator binding 
and very slow promoter activation), fluctuations from activator binding can be neglected 
compared to fluctuations from promoter activation and transcription such that the averaging 
over B cancels out in Eqns. ( ITT]) . Therefore, noise contribution from activator binding, cr^, 
is not present and we set I = (X, i?, A). Explicit expressions for the different intrinsic 
noise contributions of promoter activation, and transcription, cr|., can be obtained from 
Eqns. dl]),® and ([9]) (see SI for a detailed derivation). We introduce the abbreviations 



A = {A)B,q\+\\ 



(13) 



representing the average acceleration of protein synthesis in absence of any degradation, and 

1 - exp[-ARTG] 



G) 



(14) 



l_2-iexp[-A«TG] 

Assuming infinitely many cell divisions and averaging over all intrinsic processes yields for 
the mean amount of protein and mRNA: 



W-to))i = ^ 
1 - 



R 



^R ~ ^X 



-X^it-to) _ „-A^(t-to) 



A 



A 



R 



^X 



1 - e'^x 



R 



^R " -^X 



'A 



R 



{R{t - to))i = {A)B,q-^ 1 - (1 - 2-iZ^,)e-^H(*-*o) 



(15) 



(16) 



The terms involving the generation time, Tq, reflect the memory of the contributions 
that have been generated in previous generations and have been passed over to the actual 
generation. Since we account for cell division, the results can be directly compared to 
time-resolved expression data of protein levels. 

In the following we discuss three important physiological limiting cases in detail. The 
derived asymptotic expressions for the mean and the variances are valid for any time t 
within a given cell cycle, < t — < Tq, where to denotes the time point of the last cell 
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division. In the first two cases we omit the exphcit formulas for the corresponding noise 
contributions rj = a/fi, since these formulas are easily derived but do not contain new 
information with respect to the derived mean and variances. 



Case I: Short mRNA lifetimes and long protein lifetimes, (A^) ^ ^ Tq i^x) ^- This is the 



most likely physiological case. We obtain for t — to, Tq ^ {Xji 
expressions for the average amount of protein and mRNA: 

(X(t-to))i = ^[it-to)+TG] 



,-1 



(i?(t-to))] 



^R 



the following asymptotic 

(17) 
(18) 



Note that Eqn. f[T7j) implies a linear increase of the mean amount of protein in time as 
well as a doubling of protein synthesized over one cell cycle, {X{Tg))i = 2(X(to))i- The 
stationary mRNA level is recovered immediately after cell division. The noise contributions 
from transcription and promoter activation are also time-dependent (for derivation see SI) 



and read in the limit t — to, Tq ^ {Xj^ 
alit - to) 
a\{t-to) 



2A- 



(A] 



t-to + -Tg 



2A 



2var{A) 



[t - to) + -Tg 



(19) 
(20) 



Case IT. Long mRNA lifetimes and long protein lifetimes, (A^)^^, (Aj^ » Tg- Molecule life- 
times are significantly larger than the generation time and the switching rate of promoter 
activation, which result in the asymptotic expressions for the means: 

1 



{X{t~to))i = -A[{t-to)' + T'a] 
{R{t~to))i = {A)B,gX+[t-to + TG] 



(21) 
(22) 



The mean amount of mRNA increases linearly whereas the mean amount of protein increases 
quadratically within one cell cycle. Both the mRNA and the protein synthesized over one 
cell cycle are doubled. In this asymptotic case, dilution due to cell division is the only 
mechanism which determines the correlation times. Regarding transcriptional noise, we find 
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in the limit t — tQ^Tc ^ 7^^ strong contributions from previous generations, reflected by 
several terms involving Tq'- 



^lit - to) = IaX+ 



{t-t,f + Tl{t-t,) + -T, 



G 



(23) 



Noise due to promoter activation in the limit t — to; ^ 7a^ takes the functional form: 

<y\{t - to) = Ia'^-^- [it - to)' + Ta{t- to)^ + 12T^(t - to) + 14T^] . (24) 
Experimentally one could test this case using a set-up with artificially stabilized mRNA. 



Case 111: Short mRNA lifetimes and short protein hfetimes, (A^J)"^, (A^^)"^ ^ Tq- In the ex- 
ceptional case that both protein and mRNA lifetimes are significantly shorter than the 
generation time, memory over generations is eliminated and the stationary protein level is 
recovered immediately after cell division. The solutions are consequently time-independent 



and agree with those found earlier by Paulsson (l29l : l30l ) for the mean amount of protein and 
mRNA: 



A 



■^R 



(25) 
(26) 



and the variances 



a 



R 



4 



A 



A 



X 



AxAfl(A^ + \x) 
A'^var{A) 



1 + 



A 



R 



(27) 
(28) 



(^)'ArA3^(7a + A^)(A^ + A^) V ' lA + Xx. 
The noise contributions from transcription and promoter activation to gene expression noise 
are given by 

1 A^ 



^x 



{R)iXn + X 



^x 

2 _ var{A) A^ 
Va 



A 



X 



1 



A^ 



7a + A 



X 



(29) 
(30) 



and have been calculated by Paulsson as the normalized stationary variance with the same 
result (Isol), Eqn. (4)). 
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Noise regimes account for different experimental observations 



'. Different origins o 



(B; S S 



16 



"noise fiave been proposed and measured by several experimental groups 



20 



22I). Recent 



V Kaufmann and van Oudenaarden critically re- 



viewed these experimental observations (jl9l). The contradictory results, even in the same 



eukaryotic organism S. cerevisiae, support the idea that gene expression is influenced by 
more than one main driving source. In the following we focus on the budding yeast 5*. 
cerevisiae and the contradictory experimental results found by Bar-Even et al., Newman et 



al. and Raser and O'Shea p; [Z; 



14j ). Bar- Even et al. as well as Raser and O'Shea devel- 



oped the same mathematical model to describe gene expression: it contains the processes of 
gene/promoter activation, transcription and translation (see Supporting Online Materials 
of (j5|) and (j7|)). However, the authors interpreted the theoretical results according to their 
experimental observations. Raser and O'Shea (|5!) measured the intrinsic noise strength of 
the PH05 and PH084 promoters at different rates of gene expression in promoter con- 
structs. They concluded that the noise intrinsic to gene expression is promoter-specific: 



noise generation at the PH05 promoter depends on stochastic promoter activation due to 
chromatin remodelling. In contrast. Bar- Even et al. (j?) investigated native expression of 43 
genes in each of 11 conditions, whereas Newman et al. (jl^ presented an extensive overview 
of protein noise for more than 2500 proteins expressed from their endogenous promoter and 
natural chromosomal position by the use of a combination of high-throughput flow cytom- 
etry and a library of GFP-tagged yeast strains. Both latter studies concluded that random 
hirth- and- death of low-copy mRNA molecules describe the large observed variations quite 
well: for the great majority of proteins the noise level is inversely proportional to the mean 
protein abundance implying a clear signature of a Poisson process. The obvious question 
arises: what is the predominant source of noise: promoter activation, as suggested by Raser 
and O'Shea, or mRNA fluctuations due to low copy number as proposed by Bar- Even et 
al. and Newman et al.? An explanation for these contradictory experimental results is 
given by our stochastic model considering activator binding explicitly. For approximately 
constant activator concentration and high amount of protein synthesized per mRNA, the 
intrinsic variance af of Eqn. (fT2l) reduces to cr| = (y\ + a\. Hence the ratio ct'a/'^r determines 
the predominant source of noise: if <y\/a'^ ^ 1, promoter activation will be the dominant 
process while in the case of <y\/(j\ <^ 1 the major part of gene expression noise is due to 
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transcription. For long protein lifetimes and short mRNA lifetimes (Eqns. (fT9ll . (l20l) ) the 
ratio is given by 

a|_ var{A) _ A^A+ 

Note that this ratio is time- and cell cycle-independent although a^, Eqn. f|T9l) . and 0"^, 
Eqn. (!20l) . both depend on the cell cycle time, t — to, and generation time, To- For short 
mRNA and short protein lifetimes we obtain from Eqns. fl27j) . fl28|) the following ratio: 



al _ var{A) A+ / A 



0" 



If we subsequently assume, as it has been done by Bar-Even et al. and Raser and O'Shea, 
that protein lifetimes are longer, such that switching between promoter states occurs more 
frequently than a protein degradation event, A]^ ^ 7a ^ A^, Eqn. (15^ reduces to Eqn. (pTi) . 
It follows from Eqns. ( 13T|) and ( l32l) that for fixed rates the probability of activator binding, 
Beq, determines the value of the ratio cr\/aj^, i.e. the induction level of the activator selects 
the predominant source of noise. Thus, we expect for highly expressed genes, B^g 1, 
to show signature of noise from promoter activation provided A^Ajj > 7^ since in this 
case we find o'a/'^r > 1- contrast, we expect for low induced genes, B^q ^ 1, to show 
signature of Poissonian noise from mRNA synthesis, since in this case cr\/a'^ < 1 holds given 
AaA/j/tI is not too large. Therefore, the induction level of the activator, B^q, provides an 
excellent explanation for the observation of different noise contributions even in the same 
organism. In fact, activator induction is expected to be quite low for experiments with 



native genes ([9|; Il4l ). Bar- Even et al. investigated native genes implying a large set of 
low induced genes, B^q ^ 1, such that transcription is the prevailing source of noise. In 
contrast, Raser and O'Shea constructed yeast strains that expressed CFP and GFP proteins 
from identical promoters. In constructs activator induction is very high, B^q 1, such 
that we expect that promoter activation noise is the dominant noise contribution. In Fig. [2] 
we present the mean protein abundance vs. noise. In order to select arbitrary time points 
t within a given cell cycle, < t — to < Tq, we use the full expressions for the mean 
amount of protein, Eqn. (IT^ . and noise contributions (see SI), because the approximations 
presented in Eqns. f lTTl) . (HM and fl2U]) are only vahd in the asymptotic limit t — to ^ (^r)^^- 
We calculate the mean amount of protein and the noise contributions for several genes at 
randomly selected time points for several induction levels of the activator binding Beq. In 
Fig. [2JA. we assume a low induction of the activator, where the mean induction level Beq 

12 



equals 0.07, and therefore mimics the experimental set-up of Bar-Even et al. The noise 
contribution arising from transcription (blue circles) dominates the overall noise (magenta 
diamonds). In Fig. [23, however, noise from promoter activation (green squares) overrules 
noise from transcription. This can be arranged with an highly induced activator, with mean 
induction level B^q = 0.7, which mimics the experimental set-up of Raser and O'Shea. We 
conclude that both experimental scenarios can be qualitatively reproduced very well with 
our stochastic model by varying the induction level of the activator binding. 



a 



Activator binding and promoter activation determine population distribution 

In eukaryotes, promoter activation is believed to occur due to chromatin remodelling 



(l5|) which erratically uncovers transcription-factor binding sites. Activator binding, 
however, is assumed to be quite fast and frequent, because of the high copy number of 
TFs. Therefore, independence of activator binding and promoter activation seems to be 
a reasonable assumption in eukaryotes. For prokaryotes, the situation is less clear since 
a possible explanation or mechanism for the slow process of promoter activation is still 



lacking, although it has been measured quite accurately fl21l : |22| ) . In recent experiments 
Elf et al. measured the time scale for the binding/unbinding of an activator/TF at the 
single-molecule level in a living cell of E. coli. The binding/unbinding of highly abundant 
TFs is also suggested to be quite fast (33:). Therefore, activator binding does not seem to 
be the limiting step within the process of gene expression. In order to gain insight into 
the influence of the activator binding on promoter activation in prokaryotes, we compare 
experimental data with simulations. Kollmann et al. (see (1351), Fig. 2a; redrawn in Fig. [3j\, 
inset), compared the mean expression of CheY in a wild-type of E. coli and flgM cells, 
where the upstream transcription inhibitor, FlgM, was deleted. The deletion corresponds 
to a sevenfold over expression of CheY. Several effects of an activator /repressor on the 
activation of the promoter are possible. We discuss the three most intuitive scenarios: 

1. Activator binding and promoter activation are independent: The RNA-polymerase can 
start transcription if and only if the activator is bound (repressor is unbound) and the 
promoter is active. 

2. Activator binding enhances promoter switch- on rate: For an experimentally observed 
sevenfold over expression we assume that the switch-on rate X\ of the over expression of 
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CheY is enhanced compared to that of the wild-type. The switch-off rate is not affected. 
3. Activator binding decreases promoter switch-off rate: For an experimentally observed 
sevenfold over expression we assume that the switch-off rate of the over expression is 
decreased compared to that of the wild-type. The switch-on rate X\ is not affected. 
Of course, combinations of the mentioned scenarios are possible and likely to occur in 
nature. However, to keep the estimated parameters identifiable, we only focus on these 
three limiting scenarios. 

The experimental data shows that the mean protein level of the over expressing flgM cells 
is sevenfold higher than that of the wild-type cells. Furthermore, the standard deviation of 
the population distribution for the flgM cells increases quite significantly compared to the 
wild- type cells (Fig. [3]A. (inset) and Tabled]). Fig. [3]A. shows the population distributions 
of the wild-type and flgM cells for the different scenarios after parameter estimation (see 
Material and Methods). The parameters are estimated such that the wild-type standard 
deviation and the mean fluorescence level of the flgM cells are represented best. The 
estimated parameters {Beq for the first scenario, A^, A^ and the level of the over expression 
OE for second and third scenario) and the corresponding characteristics of the population 
distributions are summarized in Table [TJ 

The simulations reveal that there exits a set of parameters for the first scenario, where the 
activator binding does not influence the promoter activation process directly, such that 
the characteristic standard deviation of the wild-type and the mean fluorescence level of 
the flgM cells is reproduced very well {x^ = 0.004). Furthermore, it also mimics (without 
any optimization) the increased standard deviation of the flgM cells (cf. Table [U and Fig. 
|3j\, red line denotes wild-type, black line denotes sevenfold over expression). The mean 
activator binding for the flgM cells is 7.3 times larger than that of the wild- type cells 
{Beq^WT = 0.13, B^q OE = 0.95) which leads to an about sevenfold protein over expression of 
the mean fluorescence level. For the second scenario, where activator binding and promoter 
activation are not independent of each other, but activator binding enhances the promoter 
switch-on rate, the simulations with the estimated parameters does not represent the 
characteristic standard deviation of the wild-type and the mean fluorescence level of the 
flgM cells equivalently well (x^ = 0.67, Fig. [3JA- green and blue lines denote wild-type and 
flgM cells, resp.). The standard deviation of the flgM cells becomes much larger than the 
experimental one. The estimated parameters are given by \\ = 0.005 for the wild-type 
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switch-on rate and OE = 413 for the increased multiphcation factor for the flgM cells. 
The promoter is switched-on in 5% of the time for wild-type cells and 95% of the time 
for flgM cells. For the third scenario, where the binding of the activator decreases the 
promoter switch-off rate, a set of parameters can be found such that both characteristics 
are reproduced {x^ = 0.04, Fig. [3JA. cyan and magenta dashed lines represent wild-type 
and flgM cells, resp.). The standard deviation of the flgM cells is also increased quite 
well. The resulting parameters = 0.39 for the wild-type switch-off rate and OE = 924 
for the reduction factor for the flgM cells imply that the promoter is switched-off 90% 
of the time for wild- type and 1% of the time for flgM cells. If we compare the resulting 
skewness of the flgM cells in each scenario with the experimentally measured one we find 
that the second scenario has the most positive skewness, but this scenario does not fit the 
characteristics quite well. For the first scenario, the characteristics are represented very 
well and the skewness is also increased compared to the third scenario. We conclude that 
activator binding and promoter activation can be considered to good approximation as 
independent processes in prokaryotes. Of course, this hypothesis has to be investigated in 
further experiments. 

Recently, stochastic dynamics has been linked to population distributions implying the 
classical model of burst-like transcription and translation (Jsi^)- A Gamma-distribution fitted 
the stochastic simulations well and reproduced specific shapes of the population distribution 
at steady-state. 

Here, we investigate the influence of the ratio n = Aj^/A^ of the promoter switching rates 
on the shape of the population distribution. Simulations of the model presented in Fig. [1] 
reveal that different cell distribution shapes can be generated for a fixed switch-off A^ rate 
by varying the switch on rate X\. If the promoter activation is much smaller than its 
inactivation, k <^ 1, the resulting protein distribution peaks at zero (Fig. [3]B, blue line). 
For appropriate promoter activation rate, the protein distribution seems to be log-normally 
distributed (green line). If the promoter activation rate exceeds the inactivation rate, k > 1, 
the protein distribution will be shifted to a normal distribution (magenta). The inset of 
Fig. [SfB shows the corresponding protein trajectories of the different switch-on rates. The 
remaining reaction rates of activator binding, transcription and translation have an impact 
on the mean amount of protein or the noise strength, but not on the shape of the protein 
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distribution itself. If, however, the time scale of promoter activation is much faster than 
that of activator binding, e.g., for strongly repressed genes, the shape of the distribution can 
also be influenced by strong fluctuations in the activator concentration, i.e., the roles of the 
activator binding and promoter activation are interchanged. Thus, the results of Fig. [3]B can 
also be obtained by fixing the promoter to its on state and choosing appropriate parameters 
for the activator binding. 



Discussion 



Different experiments have identified different causes for the main contribution to gene 
expression noise. This implies that there might be no general rule for the main source of 
noise or comprehensive knowledge of the overall noise architecture. However, the theoretical 
framework presented in this study allows to quantify the relative contributions of the 
different sources of gene expression noise. Taking cell division explicitly into account we 
derive time-dependent, non-equilibrium solutions for the mean amount of mRNA and 
protein as well as for the noise contributions from promoter activation and transcription. 
In order to interpret the analytical results with respect to their biological relevance, we 
discuss asymptotic cases for the molecule lifetimes representing essential physiological cases. 
Our analysis confirms the intuition that molecule lifetimes, compared to generation time, 
determine the influence of the noise contributions from previous generations on the overall 
noise level of the actual generation. Long-term memory effects and noise accumulation from 
previous generations might become important if molecule lifetimes are much larger than 
the generation time. In addition, the main contribution to the cell-to-cell variation within 
a clonal population depends strongly on the kinetic rates associated with the expression of 
each individual gene. We show that the induction level of the activator or TF binding, Bgg, 
determines crucially the ratio of noise from promoter activation to noise from transcription, 
cr^/cr|j,, and thus the dominant source of noise. It follows that the experimental set-up 
for the same organism S. cerevisiae of in-vivo experiments performed by Bar-Even et al. 
(7) and in-vitro experiments by Raser and O'Shea (^, plays a fundamental role for the 
experimentally measured noise level. Low induced genes (r) bear clear transcriptional noise 
signature due to low-copy number of mRNA molecules, whereas highly induced genes (l5|) 
show typical characteristics of noise stemming from promoter activation (Fig. [21). Therefore, 



16 



the proposed model provides an explanation for these contradictory experimental results 
since it is able to reproduce both results for the same organism S. cerevisiae. Additionally, 
this suggests that the incorporation of an activator binding process acting independent of 
the promoter activation process is important for a theoretical description of gene expression. 
In eukaryotes, independence of activator binding and promoter activation is a reasonable 
assumption whereas in prokaryotes the situation is less clear. Based on published, exper- 
imentally measured cell distribution of wild-type and flgM cells of E. coli, we performed 
parameter estimation with the proposed model to discriminate between limiting cases of 
the effect of activator binding on promoter activation. Simulations reveal that for the 
first and third scenario a set of parameters can be found such that the characteristic 
standard deviation of the wild-type cells and mean fluorescence level of the flgM cells can 
be reproduced quite well. An increased standard deviation for the flgM cells can also 
be observed in both scenarios such that a qualitative distinction between both scenarios 
based on simulations seems to be difficult. The biological interpretation of the estimated 
parameters, however, suggest that the first scenario, i.e., the independence of promoter 
activation and activator binding, is more likely since the cellular effort of a sevenfold 
increase of the mean activator binding B^q is reasonable. If the binding of the activator 
increases/reduces the promoter switch-on/-off rate, in the second and third scenario, 
respectively, the promoter is switched-off most of the time (95%/90%) for the wild-type. 
However, these delay times for the protein production, even for a repressed gene, contradict 
the experimental observations in E. coli where proteins are produced quite continuously 
(pcrs. communication V. Sourjik). Since the protein level should be strongly optimized to 
allow for reliable information processing, it seems to be very unlikely that promoters have 
evolved which are so strongly repressed that they are switched-off nearly all of the time. 
Furthermore, the over expression factor OE is estimated to be very large in the second 
{OE — 413) and third {OE — 924) scenario. This implies that an experimental depletion of 
an upstream inhibitor leading to a sevenfold increased mean fluorescence level corresponds 
to a theoretical Ri400/900fold increase/decrease of the switch-on/-off rate. The cellular 
effort to achieve this is expected to be very high, such that these scenarios seem to be 
inefficient. 

The increased skewness (skewness of the flgM cells of about 1.74) observed by KoUmann et 
al. in experiments is not reflected in any scenario of our underlying model. The proposed 
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model is a reduced description of the overall system and does not take into account the 
complex fiagella network such that an entire coincidence of the experiments and simulations 
is not expected. The skewness might also be influenced by external factors, like variations 
in ribosome or polymerase concentrations or by feedbacks of downstream to upstream 
processes, but none of these features are explicitly included in the present model. 
Thus, the first scenario reproduces the experimental observations quite well and the 
estimated parameters can also be interpreted biologically reasonable. This observation 
implies that the promoter activation rates might be an intrinsic property of the biological 
system, both for the wild-type and the over expression line. One way to regulate gene 
expression is to finetune and control the mean binding of the activator. 
If the time scales of activator binding and promoter activation can be separated, the ratio 
between the promoter activation and inactivation rate determines the shape of the popula- 
tion distribution (Fig. [SfB). This suggests that the often interpreted log- normal distribution 
of single cells does not result from consecutively multiplicative stochastic processes but 
rather reflects the switching rates of the slowest process within gene expression. 



Materials and Methods 



Simulating stochastic processes. We assume that activator binding and promoter activation 
can be described by a random telegraph process with transition rates A^, (activator binding), 
X\ and A^ (promoter activation). The initial state of the promoter is determined by drawing a 
uniformly distributed random number (URN) r € [0, 1] and checking whether r < A^/7a, such that 
the promoter is on. Otherwise it is off in its initial state. Transcription is a birth- and- death process 
with time-dependent synthesis rate A^(t) = X~^A{t)B{t), i.e. mRNA can only be synthesized if 
the activator is bound and the promoter is in its on-state (see Results and Fig. [1]). The original 



Gillespie-algorithm (j36l ) has been refined (for review see (1371) ). but also modified and extended to 
model growing cell volume via time-dependent reaction rates (38). To determine the next time r 
of the reaction and the next reaction u itself for time-dependent reaction rates, we follow along the 
lines of Gillespie (jstj) and Lu et al. (38) and arrive at the cumulative distribution function 



F(r) = 1 — exp 



/ a,{t + T')dr' 



--■■l-Po{r). 



(33) 
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Drawing a URN ui G [0, 1] we set ui = 1 — Po{t) and obtain, since 1 — ui is also a URN, the new 
URN ui = Po{t). The stochastic time r for the next reaction to occur is obtained by inverting 
this equation. We formulate the cumulative distribution function for transcription with a time- 
dependent synthesis rate ai{t) = A^(t) = X^A{t)B{t) of mRNA. The degradation does not depend 
on time t > t with transition rate 02 = X]^R{t) where R{t) represents the actual amount of mRNA 
at time t. We find that at time t, the next stochastic time r has to satisfy 

In(ni) = -A;^ / B{t + T')A{t + T')dT' - X^R{t)T. (34) 
Jo 

Drawing a second URN U2 € [0, 1] , the next reaction fi must fulfill the inequality 

X+A{t + T)B{t + t) <U2 {X-^A{t + T)B{t + r) + X-^R{t + r)) (35) 
The modified Gillespie-algorithm with time-dependent reaction rate ai[t) determines the next 
stochastic time r in Eqn. (j34p as the upper bound of the integral. In our case, however, the 
integrand has a very special form, i.e., it is 1 if and only if the activator is bound as well as 
the promoter is on. Otherwise the integrand is 0. Therefore, the integration becomes a simple 
summation over all on-states, Or, of the product of the activator times the promoter within the 
time interval [t, t + r]. We define Ur as the ratio of the on-states to the time interval [t, t + r], i.e., 
{)<ar = Or/T<l. The next time r can therefore be calculated according to Eqn. (j34p which 
reduces for an exponentially distributed stochastic variable In(tii) =: —z to 

^ = -T TT- (36) 

X+ar + X^R{t) ^ ' 

Note that a,- depends on the single realization of A{^^B(t) and is thus also a stochastic variable. 

If A(t) and i?(t) are time- independent, e.g., A(t)B{^) = 1, it follows that Or = 1 and the orig- 
inal Gillespie-algorithm (Q) is recovered. Therefore, we use the original Gillespie-algorithm to 
determine the next time r and reaction fj, and check afterwards whether the proposed Gillespie- 
step can be performed or not. If A{t + T)B{t + t) = 1, mRNA synthesis can be realized, but 
if A(t + T)B{t + t) = and mRNA synthesis is selected as reaction /i, the step is rejected and 
new URNs are drawn. In general, this procedure will always select a stochastic time r which is 
smaller than that of the modified algorithm of Eqn. p6|) since < a,- < 1. However, the above 
procedure of taking the original Gillespie-algorithm and rejecting specific reactions is equivalent to 
the determination of the next time r via Eqn. ()36p . To obtain the same stochastic time for both 

procedures, the following equation should hold: 

^ _ A+g^ + X^Rjt) ^ 

Xl + X-^Rit) ^' ^''^ 
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where z and z are exponentially distributed variable stemming from a URN u via ln(n). This equa- 
tion is true since an exponentially distributed variable can be described by the product of a constant 
(given a specific Ur) times another exponentially distributed stochastic variable z. Therefore, tak- 
ing the original Gillespie-algorithm and rejecting specific reactions according to the time-dependent 
trajectory A{t)B{^^ is just another realization of the modified Gillespie from Eqn. ()36p and aver- 
aging over a lot of trajectories yields the same result. 

The generated mRNA trajectory R{t) can directly be used to calculate the appropriate mean pro- 
tein number {X{t))x given by Eqn. ([8]). At the end of one cell cycle, the cell divides symmetrically 
into two daughter cells. The mother's amount of protein and mRNA is divided binomially to both 
daughter cells. To demonstrate that the modified Gillespie-algorithm computes the correct solu- 
tion. Fig. SI shows a comparison of simulated mRNA and protein trajectories with the analytical 
ones from Eqns. (jlSp and (|16p . respectively. 

Generating population distributions. In order to avoid dependence on the initial conditions, 
we start with a certain amount of mRNA and proteins and simulate in total 15 generations. After 
five generations we randomly determine one cell with its amount of mRNA and protein to be the 
mother cell for the next five generations. The system is equilibrated and the actual simulations 
can be started. We randomly choose one cell to be the mother cell and generate 10 offspring 
generations (=1024 cells in the 10th generation). To compare the influence of the length of the 
generation time Tq on the population distribution, we simulated populations with fixed generation 
time and varying generation time (choosing Tq £ Af{fi,a)). We did not observe any significant 
differences between these two scenarios and therefore fix for simplicity the generation time in all 
simulations. 

Effect of activator binding on promoter activation: parameter estimation. A least-square 
fit is performed with MATLAB such that, after data normalization to mean wild-type fluorescence 
of 1, the experimental standard deviation of the wild-type cells (cr = 0.69) and mean flgM fluores- 
cence {fi = 6.96) of the population distributions are best represented. For each optimization step, 
20 X 2^ = 10240 realizations of the proposed model are generated. The following parameters are 
fixed for the simulations: = 2, = 0.2, Aj = 4 and A^ = 10~^. In the first scenario, the 
promoter switch-on/-off rates are set to X\ = 0.05 and A^ = 0.1, representing realistic kinetic rates 
for a repressed gene (2]][). The mean activator binding rates of wild-type (-Beg.iyr) and flgM cells 
{Beq,OE) are estimated separately. For the second and third scenario we assume a mean activator 
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binding of B(.q = 0.5 for the wild-type as well as for the flgM cells. The promotor switch-off rate 
is set to = 0.1 and the promoter switch-on Aj^ ^^j, for the wild- type and the strength of the 
over expression, OE, are estimated in the second scenario. If the activator is bound, the promoter 
switch-on rate for the flgM cells, AJ^q^, will be enhanced by the over expression factor OE, i.e. 
AJ[ = OE X AJ[ ^^rp. In the third scenario we set the promoter switch-on rate to X\ = 0.05 and 
estimate the promoter switch-off rate A^ ^^j, for the wild-type and the strength of the over expres- 
sion OE. If the activator is bound, the promoter switch-off rate for the flgM cells, A^ q^, will be 
reduced by the over expression factor OE, i.e. A^ = X^^^j./OE. The estimated parameters 
are summarized in Table [TJ 

Acknowledgements 

This work was financially supported by the Graduate School "Plant Signal Systems" 
(DFG grant no. GK1305 for JR) and the DFG Emmy Noether-Program (for MK). 
Competing interests The authors have declared that no competing interests exist. 
Author's contributions MK initiated and supervised the study, MK and JR carried 
out the analytical calculations, JR performed the simulations, JR, CF and MK wrote the 
manuscript with useful comments from JT. 

Abbreviations G/C/YFP, green/cyan/yellow fluorescent protein; TF, transcription factor; 
OE, over expression; URN, uniform random number 

References 

[1] Spudich JL and Koshland, DE Jr. (1976) Non-genetic individuality: chance in the single cell. 
Nature 262: 467-71. 

[2] Raser JM and O'Shea EK (2005) Noise in gene expression: Origins, consequences, and control. 
Science 309: 2010-3. 

[3] Avery SV (2006) Microbial cell individuality and the underlying sources of heterogeneity. Nat 

Rev Microbiol 4: 577-87. 
[4] Elowitz MB, Levine AJ, Siggia ED and Swain PS (2002) Stochastic gene expression in a single 

ceh. Science 297: 1183-6. 



21 



[5] Raser JM and O'Shea EK (2004) Control of stochasticity in eukaryotic gene expression. Science 
304: 1811-4. 

[6] Volfson D, Marciniak J, Blake WJ, OstrofF N, Tsimring LS and Hasty J (2006) Origins of 

extrinsic variability in eukaryotic gene expression. Nature 439: 861-4. 
[7] Bar-Even A, Paulsson J, Maheshri N, Carmi M, O'Shea EK, Pilpel Y and Barkai N (2006) 

Noise in protein expression scales with natural protein abundance. Nat Gen 38: 636-43. 
[8] Becskei A, Kaufmann BB and van Oudenaarden A (2005) Contributions of low molecule 

number and chromosomal positioning to stochastic gene expression. Nat Gen 37: 937-44. 
[9] Raj A, Peskin CS, Tranchina D, Vargas DY and Tyagi S (2006) Stochastic mRNA synthesis 

in mammalian cells. PLoS Biol 4: 1707-19. 
[10] Blake WJ, Koern M, Cantor CR and Collins JJ (2003) Noise in eurkaryotic gene expression. 

Nature 422: 633-7. 

[11] Pedraza JM and van Oudenaarden A (2005) Noise propagation in gene networks. Science 307: 
1183-6. 

[12] Golding I, and Cox EC (2006) Eukaryotic transcription: what does it mean for a gene to be 

'on'? Curr Biol 16: R371-3. 
[13] Mettetal JT, Muzzey D, Pedraza JM, Ozbudak EM and van Oudenaarden A (2006) Predicting 

stochastic gene expression dynamics in single cells. Proc Natl Acad Sci USA 103: 7304-9. 
[14] Newman JR, Ghaemmaghami S, Ihmels J, Breslow DK, Noble M, DeRisi JL and Weissman 

JS (2006) Single-cell proteomic analysis of S.cerevisiae reveals the architecture of biological 

noise. Nature 441: 840-6. 
[15] Sigal A, Milo R, Cohen A, Geva-Zatorsky N, Klein Y, Liron Y, Rosenfeld N, Danon T, Perzov 

N and Alon U (2006) Variability and memory of protein levels in human cells. Nature 444: 

643-6. 

[16] Rosenfeld N, Young JW, Alon U, Swain PS and Elowitz MB (2005) Gene regulation at the 

single-cell level. Science 307: 1962-5. 
[17] van Kampen, N G (1992) Stochastic Processes in Physics and Chemistry. North Holland. Rev. 

and enlarged edition 

[18] Colman-Lerner A, Gordon A, Serra E, Chin T, Resnekov O, Endy D, Pesce CG and Brent R 

(2005) Regulated cell-to-cell variation in a cell-fate decision system. Nature 437:699-706 
[19] Kaufmann BB and van Oudenaarden A (2007) Stochastic gene expression: from single 

22 



molecules to the proteome. Curr Opin Genet Dev 17: 1-6 
[20] Ozbudak EM, Thattai M, Kurtser I, Grossman AD and van Oudenaarden A (2002) Regulation 

of noise in the expression of a single gene. Nat Gen 31: 69-73. 
[21] Gelding I, Paulsson J, Zawilski SM and Cox EC (2005) Real-time kinetics of gene activity in 

individual bacteria. Cell 123: 1025-36. 
[22] Cai L, Friedman N and Xie XS (2006) Stochastic protein expression in individual cells at the 

single molecule level. Nature 440: 358-62. 
[23] Balaban NQ, Merrin J, Chait R, Kowalik L and Leibler S (2004) Bacterial persistence as a 

phenotypic switch. Science 305: 1622-5. 
[24] Suel GM, Garcia-Ojalvo J, Liberman LM and Elowitz MB (2006) An excitable gene regulatory 

circuit induces transient cellular differentiation. Nature 440: 545-50. 
[25] Berg OG (1978) A model for statistical fluctuations of protein numbers in a microbial popu- 
lation. J Theo Biol 71: 587-603 
[26] Thattai M and van Oudenaarden A (2001) Intrinsic noise in gene regulatory networks. Proc 

Natl Acad Sci U S A 98: 8614-9. 
[27] Swain PS, Elowitz MB and Siggia ED (2002) Intrinsic and extrinsic contributions to stochas- 

ticity in gene expression. Proc Natl Acad Sci U S A 99: 12795-800. 
[28] Swain PS (2004) Efficient attenuation of stochasticity in gene expression through post- 

transcriptional control. J Mol Biol 344: 965-76. 
[29] Paulsson J (2004) Summing up the noise in gene networks. Nature 427: 415-8. 
[30] Paulsson J (2005) Models of stochastic gene expression. Phys Life Rev 2: 157-75. 
[31] Friedman N, Cai L and Xie XS (2006) Linking Stochastic dynamics to population distribution: 

An analytical framework to gene expression. Phys Rev Lett 97: 168302. 
[32] Alon U (2006) An Introduction to Systems Biology: Design principles of biological circuits. 

Chapman & Hah/CRC. 301 p. 
[33] Elf J, Li GW and Xie XS (2007) Probing transcription factor dynamics at the single- molecule 

level in a living cell. Science 316: 1191-4 
[34] Tanase-Nicola S, Warren PB and ten Wolde PR (2006) Signal detection, modularity and the 

correlation between extrinsic and intrinsic noise in biochemical networks. Phys Rev Lett 97: 

068102-4. 

[35] Kollmann M, L0vdok L, Bartholome K, Timmer J and Sourjik V (2005) Design principles of 

23 



a bacterial signalling network. Nature 438: 504-7 
[36] Gillespie DT (1977) Exact stochastic simulation of coupled chemical reactions. J Phys Chem 
81(25): 2340-2361 

[37] Gillespie DT and Petzold LR (2006) Numerical simulation for biochemical kinetics. In: Sza- 
llasi, Z, Stelling, J, Perival, V editors. Systems Modeling in Cellular Biology, MIT press, pp. 
331-353 

[38] Lu T, Volfson D, Tsimring L and Hasty J (2004) Cellular growth and division in the Gillespie 
algorithm. Syst Biol 1(1):121-128 



24 



Extrinsic noise sources 



No feedback! 



intrinsic noise sources 
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FIG. 1 Definition and reaction scheme of single-gene reporter system (yellow box) within intracel- 
lular environment (large box). Intrinsic and extrinsic noise can only be distinguished if expression 
level of reporter system does not influence extrinsic factors. Transition rates are defined in the 
text. 
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FIG. 2 Simulated mean abundance vs. noise for different genes. Transcriptional contribution (blue 
circles), noise from promoter activation (green squares) and overall noise (magenta diamonds). 
A) Low induced activator, B^q = 0.07, leads to < 1 such that transcriptional noise domi- 

nates. B) Highly induced activator, Beq = 0.7, leads to ct^/cj^ > 1 such that noise from promoter 
activation dominates. 
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FIG. 3 A) Simulations of different effects of activator binding on promoter switcli-on/off rates 
for fixed and estimated parameters. Red and black: wild-type and flgM cells of assuming that 
promoter activation and activator binding are independent (first scenario); green and blue: wild- 
type and flgM cells assuming that the activator binding enhances promoter switch-on rate (second 
scenario); cyan and magenta dashed lines: wild- type and flgM cells assuming activator binding 
decreases promoter switch-off rate (third scenario). Means, standard deviations and estimated 
parameters are summarized in Tabled] fixed parameters are found in the Materials and Methods. 
The inset shows the experimental levels of CheY, expressed as YFP fusion from native chromosomal 



position for wild- type (red) and flgM cells (black). Redrawn from (j35l ). B) Different population 



distributions {case I), each normalized to mean 1 for varying promoter switch-on rates AJ[ at fixed 
switch-off rate = 0.1. Mean activator binding: Beq = Blue: Aj[ = 0.01; green: A^ = 0.05; 
magenta: AJ^ = 1. The inset shows corresponding simulated protein trajectories for each switch-on 



rate A^. 
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a 


skewness 


estimated 
parameter 




Experiment 


wild-type 


1 


0.69 


2.51 






flgM cells 


6.96 


3.38 


1.74 


Simulation 
1st scenario 


wild-type 


1 


0.67 


1.26 


Beg = 0.13 


0.004 


flgM cells 


7.02 


2.63 


0.44 


Beg = 0.94 


Simulation 
2nd scenario 


wild-type 


1 


1.51 


2.86 


a;^ = 0.005 


0.67 


flgM cells 


6.92 


5.25 


1.07 


OE = 413 


Simulation 
3rd scenario 


wild-type 


1 


0.63 


1.17 


A;^ = 0.39 


0.04 


flgM cells 


6.77 


2.56 


0.1 


OE = 924 



TABLE 1 Characteristic mean (/i), standard deviation (cr) and skewness for population distrr 



butions from experiments of Kollmann et al. (1351 ) and simulations using the proposed stochastic 



model. Cells in which the upstream transcription inhibitor, the anti-sigma factor FlgM, was deleted, 
are denoted by flgM cells. The deletion leads to a sevenfold over expression. Differences in the 
simulated scenarios, interpretation of the factor OE and parameter estimation are described in the 
text and in Material and Methods. 
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I. DERIVATION OF TIME-DEPENDENT SOLUTION FOR TRANSCRIPTION 

For an activated promoter, the process of transcription is assumed to be Poissonian. 
Assume that R{t) is the number of mRNA transcribed from an activated promoter 
A(t)B{t) = 1. The stochastic variables A{t),B(t) G {0, 1} denote the promoter activation 



and activator binding, respectively. We assume R{t) — ^ R{t) + 1 with time- dependent 
O ; A- _ 

^ , transition rate A^(t) = X^A(t)B{t) and for the inverse process R{t) — ^ R{t) — 1 where 



is the degradation rate. 



^ [ Assuming initially Rq molecules at time to, the stochastic Master equation for this Poisson 

o 

d 

dtP{R, t\Ro, to) = A+(t)P(i? - 1, t\Ro, to) + iR+ l)A^P(i? + 1, t\Ro, to) 



■{RX], + \+it))P{R,t\Ro,to). (SI) 



> 

OO ' To solve the equation we define the generating function 

CNj ; G{s,t\Ro,to) = ^s^P{R,t\Ro, to) such that 



dtGis,t\Ro,to) = A+(t)(s - l)Gis,t\Ro,to) - X^is - l)9,G(s, t|i?o, ^o)- (S2) 
We choose the ansatz 

G(s,t) = e("-^)"(*V(s,^) (S3) 

for which we obtain using ( ]S2ll 



dtG{s,t) = 9ta(t)(s- l)e(^-^)"(*V(s,t) + e(^-^)"Wai^(s,t) 

= A+(t)(s - l)e(^-i)"W^(s, t) - A^(s - l)e(^-i)"W [a(t)V'(s, t) + dsij{s, t)] 
= (A+(t) - X],a{t)){s - l)e(^-i)"W^(s,t) - X^{s - l)9,^(s, t)e(^-i)"«. (S4) 

Comparison of the coefficients yields 

dta{t) = A+(t) - A^a(t) (S5) 
dMs,t) = -X],{s-l)dsij{s,t). (S6) 



The solution of (IS5l) is given by 



a{t) = a{to)e-^nii-'^) + / e-^n(t-''h+it')dt' . (S7) 



to 

To determine '?/'(s,t), we introduce a new variable z := ln(s — 1) (with dz = -^ds), 
substitute s by 2; in ( IS6|) and obtain the differential equation 

dt^{z,t) = -X],dMz,t). (S8) 

Its solution is an arbitrary function F of the variable X]^t — z, so we can write ij{z^t) = 
F[e^~''*fl*] and therefore (re-substituting z by s) 

ij{s,t)=F[{s-l)e-'~^% (S9) 

It follows for the generating function 

G{s, t) = F[{s - l)e-^«*]e(^-i)"W. (SIO) 

Normalization requires G{l,t\Ro,to) = 1 and therefore F[0] = 1. 

For a complete determination of F, we have to include the initial condition P{R, 0\Ro, 0) = 
SrRo which implies 

G(s, 0|i?o, to) = V PiR, 0\Ro, 0) = = F\s - lle^'^-')"^"). (Sll) 
^ ^ ? ' 

We arrive at F[s — 1] = [{s — 1) + i]'Rog-{s-i)o(o) -^j^jcJi gives for the substitution x := s — 1 

= [x + l]-^«e-^"(°\ (S12) 
Replacing F in (ISIOI) with our fully determined F of flS12l) we obtain 



G{s, t\N, 0) = [{s - l)e-^«* + l]^exp[(s - l)(a(t) - a(0)e-^«*)] (S13) 

To start at an arbitrary time to and molecule number Rq we now include the effect of cell 
division into the generating function Eqn. (1S13P of the transcription process. It is obvious 
that the noise level strongly depends on the mean number of mRNAs. If there were many 
mRNA copies the noise from cell division would become smaller, such that the process of 
cell division would not account for the variability among daughter cells. Conversely, only a 
few copies of mRNA result in a large variability among daughter cells. Therefore this effect 



has to be taken into account when focussing on gene expression noise. 
Activator binding and promoter activation are expected to be fairly independent on cell 
division and the amount of protein is simply halved if the cell divides symmetrically as for 
most systems the copy number of proteins is > 10^. Thus transcription is the only process 
where we have to account for cell division in the probability density function. 

Assuming exactly one symmetric cell division with binomial distribution of the mRNA 
at time ti within to < t2, the expression for the conditional probability density function of 
mRNA synthesis and decay is given by 

PiR^,t,\R^,to)= f2 5^ P(i^2~,^2|i^^tl)Q+)2-^^p(i^^,tl|i^o'','^o), (sm) 

R-=OR+=0 

with Rl and Ri is the amount of mRNA of the daughter and mother cell, respectively. 
It can be easily seen via induction over the number of cell divisions, that the probability 
density function of mRNA synthesis and decay after n cell divisions at ti, . . . , t„ reads with 
tn+i > t > tn and ti > f > to 

oo 



P{R,t\R',t') = 5^ P(P,t|i?+,t„)L;j2-^"P(p-,t„|i?',t') 

= E E^(^'^i^-^n)Qu2-^^-p(pr,tiiP',o 

R-=OR+=0 R-=OR+=Q ^ 

^ n [^?)^~''^PiR7MRti,U-i). (S15) 



The generating function corresponding to one cell division at ti is given by 
Gis2,t2\Ro,to) = 5^ ^(P2,t2|P^,ti)Q+^2"^^P(Pr,ti|Po,to) (S16) 
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X 



J2 [(^2 - l)e-^«(*2-*i) + 1]^^ exp (S2 - 1) / ' e-^^^'^-''h+{t')dt' 
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R- R+=0 ^ ^ 



X exp 



{S2 - 1) e-^^'-''-''h+{t')dt' 



P{Rl , tll-Ro; ^0 



E 



J(S2 - l)e~^R^''~''^ + 1 



Si 



R- 



X exp 



t2 



(S2 - 1) / e-^«(*^-*')A+(t')rft' 



JST3t 



-ti 





r f' 


exp 


- 1) / 

jti 



P{Ri , til-Ro, to) 

t2 



(S17) 



-A^(fe-t')A+(t')(it' 



X exp 



(^1-1) 



to 



(S18) 



-(S2 - l)e-^«^*2"*«) + 1 



2 

X exp 



- Ro 




exp 


(^2 - 1) f 




Jti 



t2 



(S19) 



By repeated application of the method above, i.e. assuming a finite number of cell divisions, 
the generating function is given by 



X exp 



s-l) rA+(tOe-'«(*-*')2-^(*'*')(it' 



(S20) 



where t') = D{t', t) represents the number of cell divisions within the time interval [t, t']. 
The above formula is easily proven via induction over the number D{t,t') of cell divisions 
within [t, t']. 

We first compute the mean mRNA copy number, {R{t))R, with the help of the generating 
function Eqn. flS20p . Setting = — oo, i.e. including an infinite number of cell divisions, 
we find that 

G{s, t\Ro, -oo) = exp [(s - l)a(t)] (S21) 
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with a(t) defined by 



' — oo 

oo J to 

Note that D{t,t') = D{t,t") + D(t",t') and that introduced here is a generahzation of 
without cell division (cf. flS7|) ). 



As Eqn. ( 1S20P is the generating function of a Poisson process we obtain the mean of R{t) 
by calculating dsG\s=i 



a 



(t) = {R{t))n = (i?(to))R2-^(*«'*)e-^«(*-*») + f X+{t')e'^'^^'-''h-''^''''^dt' . (S23) 



To calculate the variance we use a generating function Q{s,t) which contains the joint 
probability function P{R,t; RojIq) representing the probability that at time t there are R 
molecules and at time to there are Ro molecules. Afterwards we determine the variances by 

{R{t)R{to))R = dse\s=i. (S24) 

We again assume that initially the process is Poissonian-distributed and use the notation of 
(IS20p . i.e. G{s,t\Ro,to) = (x + l)^°K{s,t). The generating function 6(s,t) is given by 

Ro R Ro R 



G{s,t\Ro,to) 



(«(to)(x+l))^° 



[X 



=ea(to)(^+l) 

(a; + l)ir(s,t)a(to)e°(*°)" 
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= [2-o(*o,t)(s _ i)e-^H(*-*o) + l]a{to) 



X exp 



Jto 



= [2-^(*°'*)(s - l)e-^H(*-*°) + l]a{to) exp[(s - (S25) 
It follows that 

{Rit)R{to))R = a,e|,=i = «(io)(2-^(*°'*)e-^H(*-*°) + (S26) 
Replacing to by t' we obtain for the time correlation 

{Rm{t'))R = a(t')(2-^(*''*)e-^«(*-*') + (i?(t))R) 

= (i?(0)i?2-^(*' *)e-^H(*-*') + (S27) 

II. PARTIAL CONTRIBUTIONS TO GENE EXPRESSION NOISE 

For a one-gene system (cf. Fig. 1 main text) whose gene product does not interfere 
with external parameters (e.g. expression for genes forming ribosomes do infer with the 

translation rate and thus with external parameters) the total variation af^^ can be written 
as a sum of the following contributions 



'tot 



(t] + (7%. (S28) 



The term (t| is the intrinsic noise contribution for fixed environmental conditions whereas 
the term (t| represents the effect of extrinsic noise on the mean expression level. The above 
expression for intrinsic noise is defined by 

= {{X{t) + {X{t))x - {X{t))x - {m)x,R,A,Bf)x,n,A,B (S30) 
((X(t) - {X{t))xf)^^^^^^^ + {{{X{t))x - {X{t))x,n,A,Bf)^^^^^ 

+ (2(X(t) - {X{t))x){{X{t))x - {X{t))x,R,A,B))x,R,A,B (S31) 

" V ' 

=0, after {.)x 

= {{X{t) - {X{t))xf)^^^^^^^ + {{{X{t))x - {X{t))x,nr)^^^^, 

+ {i{X{t))x,R - {X{t))x,R,Af)^^^ + {{{X{t))x,R,A - {X{t))x,R,A,Bf)^ (S32) 

= 4 + 4 + ^1 + 4 (S33) 



where the methods of Eqns. (1S30P and (lS3ip are apphed iteratively to the re- 
maining terms. The joint probabihty P{X{t); R{t)] A{t)] B{t)) can be written 
in terms of the following conditional probabilities, P{X{t); R{t); A{t); B{t)) = 
P{X{t)\R{t))P{R{t)\A{t),B{t))P{A{t))P{B{t)). The averaging over the processes is 
given by {X{t))x = J X{t)P{X{t)\R{t))dX{t) where / P{X{t)\R{t))dX{t) = 1. 
The resulting terms of Eqn. (]S33|) give the second moment contributions from translation, 
ax, transcription, a^, promoter activation, a\, and activator binding, a%. Contribution 
from extrinsic noise consist of extrinsic effects on the mean expression and an increase 
of intrinsic noise due to extrinsic factors. The above contributions can be calculated 
analytically as shown in the following. 



As mentioned in the main text, we assume a separation of the time scales of activator 
binding and promoter activation, such that we assume an equilibrated activator binding 
process. To make the calculations feasible, we additionally assume a fixed induction level of 
the activator. Beg. We use the following abbreviations, where To is the generation time, a 
denotes either X or R, (3 represents either A or E and n G N. 



Qx = $${A) (S34) 
A = (A)5e,A+A+ (S35) 

Z^a {Tg) = — r—T^ (b36) 

1-2 "exp[-nA„TGj 

" 1-2 "exp[-n7^rGj 

7 (T \ l-exp[(A-+7^)rG] 
1 - 2 iexp[(A„ +i})Tg] 
The expression levels within a large population are calculated after infinitely many cell 
divisions. The cell division times are given by t^; by i = we define the last cell division that 
has occurred. Furthermore, we denote by and tf the time just before and after the cell 
division, respectively. Given the n-th moment of the stochastic function for mRNA, fi^^\t), 
and protein number, fi^^\t), the following invariance conditions can be derived 

^^^\t) = fiPit + kTa) V keZ (S39) 
l_i^]^\t) = ^P{t + kTc) V keZ. (S40) 
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Before the next cell division at time ti, the mean mRNA and mean protein number of a 
large population is given by 

{X(t))x,R,A = {X{t-tt))x,R,A + le-'--^'-'''\X(t^))x,R,A 



= {X{t-4))x,R,A + 



1 - 



e-^~-^'-''\X{TG))x,R,A (S41) 



{R{t)) 



R,A 



m-tt))R,A + \e-^-^'-'°\R{t,)) 



R,A 



{R{t-tt))R,A + 



1 - 



e-'~-^'-'°HR{TG))R,A. 



(S42) 



Here X{t — f^) represents the amount of protein resulting from synthesis in an actual gen- 
eration, whereas Xlt^) is the amount of protein in the mother cell just before cell division. 
The precise expressions for X{t — t^) and R{t — t^) are given by 



A 

{X{t - to ))x,R,A = -rrr 

A 



R 



J_ (i _ (>-^x(t-4)] _ 
A V 



{R{t-4))n,A = {A)B,M [l - e-^H(*-*^)l 

^R 



1 _ 9-17(1) , 
^R ~ 



We obtain the expressions for the mean amount of protein and mRNA given in Eqns. (13), 
(14) in the main text. 

The transcriptional contribution is given by 



{i{Xit))x - {Xit))x,Rf) 



R,A 



\ / I{.A 



(S43) 
(S44) 



Ti 



+ 2 r {{X{4))x\xif) - (X(4))x,H(Ax(t'))«>«^e-^-(*-*'')e-^-(*-*')di' (S45) 

Jto 



T2 



+ 



f f {\x{t')Xx{t") - {Xx{t'))R{Xx{t"))R)n,A e-'-^*-*")e-^x(*-*')di'di" . (S46) 

Jto Jto 



T3 



Prom the invariance t ^ t + Tg we get an expression for the first term 



T,{t-to) = lZMM+Z3(Me-2Ai(*-*o). 



4 1 _ ig-^^x^G 



(S47) 
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Correlations between actual and previous generations are defined by 



a; 



1 - e"^^ 



1 _ Ae-(^«+^x)^G 
4 



Ar + Aj(: 
1 



1 _ e~('^fl+-^x)^G 



(S48) 



Aii - Ajf 



-2A-(t-to) _ „-(A^+A^)(t-to) 



Noise from the new generation is given by 

Ts{t-to) = 2Qx5e, 1(1-2-^4')) 
A 



Ai 



+ /'g-2A^(t-to) _ g-{Ai+A^)(t-to)'\ 

Ad — Xv V ^ 



-2X-{t-to) _ ^-\R{t-t 



Afl - '^^x 



Ar - AjY 
AjfAj 



+ 



^fl(t-to)^ (S49) 

2(A« + A3,)V ; 



-2A-(t-to) _ p-(A3^+A^)(t-*o) 



(A^)2 - (A^)2 

The major steps of the calculations to obtain T2{t — to) and T3{t — to) are shown in the 
Appendix. 



The partial contribution to gene expression noise arising from promoter activation 

is given by 



{i{Xit))x,R-{Xit))x,R,Ay 



= {{X{ti))lR - {X{tt))%^^^)^e-''-- 



2X-(t-to) 



(S50) 
(S51) 



+ 2 /\(X(t+))x,i^(Ax(0>i^-(^(4)>x,i.,A(Ax(0)i^,^>^e-^-(*-*°)e-^-(*-*')dt' (S52) 



J2 



t ft 



+ I I ((Ax(O>«(Ax(i'0)ii-(Ax(i0)i?,A(Ax(i'0)ii,A>^e-^-(*-*'')e-^-(*-*')di'dnS53) 



^3 
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Term J3 can be split up into 



t rt rt' rt 



Mt - to) 




D{t',T') 



e-^fl(*'-^') 



D(t",T") 



xe R 



X-R{t"-T")^-\-^{t-t')^-X-^{t-t") 




dT"dT'dt"dt' 



[.]dT"dr'dt"dt' + 




in ''tn —00 J —00 



[]dT"dT'dt"dt' 



h 



[\dT" dT' df di! ^ 








t nt' rt' rt" 



I 

Wdr" dr' df di!^ (S54) 



/4 



and equivalently term J2 reads 



t rtJ rt' rt 



J2{t-to) 





tn J —00 J —00 J —OD 



D{t',r') 

2j ' 



,-7Ak'-r"| / £^ ^-A«(t'-r') ' ^ 



D{t",T") 



/T\Dit+,t") 
^^-X-{t"-T")^-X^{t-t')^-\^{t-t"} { i \ 



with 



= w(A)(Ai)^(A+)X=^^l 



dT"dr'dt"dt[S55) 



(S56) 



The term J2(i — to) is calculated for the asymptotic cases of interest whereas J3{t — to) can 
be given by the formulas for Ii — I4. The explicit expressions to obtain the single terms 
for are derived in an equivalent way to the transcriptional contribution (see Appendix). 



Calculations lead to the explicit expression for term Ii, i.e. 

1 



2Ax-(7A + A^) 



(7A+Ajj)(t-to) _ -2X^it^to) 



+ 



with 01 given by 



Ax- 7a 
1 

1 

Ax - A^ 



01 



-(7A+A^)(t-to) _ ^-2\-(t-to) 
■^_2A-(t-to) _ g-2A-(t-to) 

g-(A^+A3^)(t-to) _ g-2A^(t-to) 



Zra- 



2(A],)2-7l ""A3, -A 



(S57) 



(S58) 



R 
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Term I2 is given by 

1 f 1 1 



4 1 + 7a 
1 



e-(A^+7A)TG _ e-2A- Tg 



X 



2(Ax - A«)2 



g-A^(t-to) _ g-A^(t-to) 



Term /a is calculated to be of the form 

1 1 



A^ - 7A 2A^ - (7A + A^) 



-(Afl+7A){t-to) _ -2A^(t-to) 



+ 



Ax ~ A^ 



2(Ax - A^) 



Ax - A^ A^ - 7a 

g-2A^(t-to) _ g-2A^(t-to) 



^ g-(A^+A^)(t-to) _ g-2A3^(t-to)j 



with 



^3 = 



H^-Ry-ii 

The explicit formula for term /4 is given by 



= j t f r []dT"dT'dt"dt' 

Jt^ Jt^ Jt^ Jt^ 
ft ft' ft" ft" ft ft- ft- ft- 

= / / / / [.]dT"dT'dt"dt'+ / / / / iW'dT'dt 
Jtt Jtt Jtt Jt+ Jt+ Jt+ Jt" Jt+ 



dt' 



■-0 ""O ""O ''0 



H2 



with 

Hi -- 



Ar + 7a 



1 



X 



+ 



(S59) 



(S60) 



(S61) 



(S62) 



'^^r{^x + Ai?) \2A^ 
1 



+ 



]^ _ g-2Ajy(t-to) 
1 



Ax ~ A^ 



1 



g-(A^+A^)(t-to) _ g-2A^(t 
1 



-to) 



2Aj? (A;^ - A^) (A^ - 7a) (A;^ - 7^) A^ - 7a A;^ - A^ 
1 



Ax ~ A^ 



1 



g-{A^+A^)(t-to) _ g-2A^(t-to) 
1 1 



1 



2Ai?(A^ - A^) A^ - 7a A^ - A^y 2(A^ - A^) 
1 1 



-2A^(t-to) _ -2A3^(t-to) 



(Ar - 7a)(A^ - 7a) 2A^ - (7a + A^) 



g-(7A+A^)(t-to) _ g-2A^(t-to) 



(S63) 
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and 



Ho 



X 



+ 



1 1 



1 



1 1 \ 1 



\ _ g-2A^(t-to) 



1 I \ 1 



g-(7A+A3^)(<-io) _ ^-2\-{t-to) 



f.-{>'R+lAKt-to) _ ^-2\-{t-to) 



+ 



1 1 \ 1 



(^fl+^x)(*-*o) _ p-2A3,(t-to) 



(S64) 



III. ASYMPTOTIC ANALYSIS OF THE PARTIAL CONTRIBUTIONS 

A. Case I: Short mRNA lifetime, long protein lifetime (A^)~^ <^Tg <^ (■^x)"^ 

This is the most physiological case and we obtain the asymptotic expressions for the 
mean molecule number within on cell cycle given in Eqns. (15), (16) in the main text. The 
transcriptional contribution can be split up into the correlations between the actual and 
previous generations (T2) and the noise from the new generation (T3) and we obtain for the 
asymptotic case (A^)~^ ^ Tq 

1 

'{^rY 

and 



(S65) 



Ts{t - to) = 2^-^ 

y^R) 



(S66) 



Including the contribution from Ti and assuming t — to ^ (-^r)^^) obtain for the tran- 
scriptional contribution to gene expression noise the expression of Eqn. (19) in the main 
text 

The single expression for the contributions from promoter activation are given by 



j^{t - to) = A 



2var(A) 



{A}' (A^)2-(7a)^ 



[1 - e--(*-*o)] 



lA 



{^-Rf 



(S67) 
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and 



J3{t-to) = 2(PH2 



1 



1 

7a 



(t-to)-3-(l-e-^^(*-*°)l 



(S68) 



Assuming t — to ^ (ta)"^, {Xr)~^ and including the contributions from Ji, we obtain the 
noise contribution from promoter activation given in Eqn. (20) in the main text. 



B. Case II: Long mRNA and protein lifetime (A^) ^, {X-^) ^ S> 



The terms for the transcriptional contribution read 



^2(^-^0) = -AX+T^it-to). 
and noise from the new generation is given by 



(S69) 



The contributions from promoter activation read 



and 



with 



J2{t - to) = A'^-^fj^^ \ 8TUt - to) + —Tait - to) 



4 
37a 



1 / ^ 1 — p-'y^'^G 



1 _ Ip-1aTg 



Jsit-to) = 2A 



,var{A) 



(/l + l2 + h + h) 



h 
h 

h 
h 



1 



Z^I^{t-tof 



47^ 
1 

67^1 



27b- 



.(1) 



[t - tof 



27i 



hl^ ^ lA 2 / 



{t - tof {t - tof 



37A 27i 



(S70) 



(S71) 



(S72) 



(S73) 
(S74) 

(S75) 
(S76) 



In the asymptotic hmit (for Tq, {t — to) ^ 7^^) we obtain Eqns. (23) and (24) in the 
main text for the noise due to transcription and promoter activation, respectively. 
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C. Case III: Short mRNA and short protein hfetime (A^)"\ (A^)"^ < Tq 

The mean amounts of protein and mRNA are given by Eqns. (25) and (26) in the 
main text. The terms for the transcriptional contribution are given in the hmit 

Tg » (A«)-^ (A^)-i by 

2 . Xi 



T2{t - to) = iA-——^^-—— e"2^-(*-*») - e-(^H+^-)(*-*°) ) . (S77) 
-^ijllA^j - [Ax) ) ^ 



and 

A1 

Tsit — to) = QxBeq 

A 



Afi - 2A^ 



2L f e-2A3^{t-to) _ g-A^(i-io) 



_^ X |'g-2A^{t-to) _ g-(A3^+A^)(<-<o) 

A^ - Ax 



Ax _ ^_2A-(i-fe) 



2 (Aft + A 



Xy 



_ ^k^|_^re-2A-(^-^o)_g-{A^+A-)(t-to)^ j_ ^g^g) 
(Aij) - (Ax) ^ ^ J 

The only contribution from promoter activation derives from the I4 term 
(Ji = J2 = /a = 0). As the fluctuations decay very fast we also have no contribution 
from previous generations (Ji = 0) and no correlations between the previous and actual 
generation (J2 = 0). 



We obtain 

T (f f \ - o.42 ^«K^) . A'variA) f A^ ■ 

~ - -JaT'^ - (A)^A^,Ax(7. + A«)(A, + Ax) + ^ITaJ, ' ' ^^''^ 

Eqns. (27) and (28) of the main text are obtained in the limit t — to ^ (7^1) 



IV. SIMULATING PARTIAL NOISE CONTRIBUTIONS 



We investigate the partial contributions (terms of Eqn. flS33P ) to gene expression noise 



by the following simulation protocol. If we assume a constant activator B^q = const, there 
are no contributions from activator binding. In the more general case, i.e. if B{t) 7^ const, 
it can be easily included in the simulations. 

As mentioned before, we can also neglect the noise contribution from translation. The 
remaining terms (transcription) and (promoter activation) only differ in their averaging. 

In order to calculate the partial contributions to gene expression noise, a correct averag- 
ing of the simulation results is necessary: 
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Let g,m be the number of simulations of promoter- on/- Oj^ trajectories and mRNA tra- 
jectories, respectively. Each protein trajectory has a unique promoter- on/- Oj(f and mRNA 
trajectory, so Pij is the protein trajectory resulting from the ith promoter- on/- Oj^^ and the 
jth mRNA trajectory and can be identified with {X{t))x- 

Averaging over all mRNA trajectories belonging to one promoter-on/-Oj(f trajectory, we 
obtain the mean protein trajectory belonging to the ith promoter- on/- Oj(f realization, i.e. 

Averaging over all mRNA trajectories which were obtained from all promoter- on/- Oj^ 
trajectories we have {X{t))x,R,A = ^ Ef=i ^ YlJ=iPij- 

We obtain for the second moment of transcription (cr^) and promoter activation (cr^) of 
gene expression noise on the protein level: 

9 ^ in m 



q ^ — ' m ^ — ' m 



a 



^ i=i j=i ^ i=i j=i 

To illustrate that this averaging method mimics the correct formulas, if mRNA and 
protein degradations are large compared to the generation time {case III), i.e. the system 
equilibrate very fast after cell division. Table [ST] summarizes the mean amount of mRNA and 
protein and the noise contributions from transcription and promoter activation (averaged 
over the last half of the cell cycle). If, however, one of these two rates is small compared 
to the generation time {case I and //), the simulation/averaging routine becomes much 
more complex and very time consuming. To assure that the analytical expressions are 
correct, we calculated some values of the integral representation of the partial contributions 
(Eqns. flS52p . flS53p ) with Mathematica and compared them to the explicit formulas (Eqns. 
(IS67I) .( IS7T1) ) and ( 1S57I) - (1S64I) ). For case I and //, the precise formulas evaluated at three 
different time points coincide with the integral representations at these time points. 
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V. APPENDIX 



We derive the explicit formulas for the transcriptional contribution, i.e. Eqns. (1S45|S46P 
and give some calculations leading to flS48p and flS49p . The contribution from promoter 
activation is obtained in an equivalent way. 
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analjdiics 


simulations 




5.33 


5.31 ±0.03 




106.67 


106.37 ± 0.39 




0.1071 


0.1074 ±4.6 X 10"^ 




0.2402 


0.2443 ± 2.7 X lO"''^ 



TABLE SI: Comparison of the analytical and simulation results of case III with the kinetic 
parameters AJ^ = 0.2, = 0.3, A^ = 2, A^ = 0.15, A^ = 4, A^^ = 0.2 and B^q = 1. The cells divide 
after 100 minutes. 



A B 
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FIG. SI: Comparison of analytical results (black lines) with simulations (red lines). (A) mean 
amount of mRNA and (B) mean amount of protein. The kinetic parameters are chosen from case 
I, when mRNA mRNA lifetime is short and protein lifetime is long compared to the generation 
time, i.e. \\ = 0.2, = 0.3, A^ = 2, A^ = 0.15, Aj^ = 4 and A^ = 1 x 10~^. The activator is 
constantly bound, i.e. B^q = 1, but the same results are also obtained if B^q = q < 1 (simulations 
not shown). We fix the cell cycle to io = 0, T = 50 (min) and sampling frequency to dt = 0.2. To 
obtain a good agreement between simulations and analytics, we simulate and average over 100 cells, 
each with 200 promoter-on/-off trajectories. For each promoter-on/-off trajectory we generate 200 
mRNA trajectories with the corresponding protein trajectories. 
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